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We examine a naming game with two agents trying to establish a common vocabulary for n 
objects. Such efforts lead to the emergence of language that allows for an efhcient communication and 
exhibits some degree of homonymy and synonymy. Although homonymy reduces the communication 
efficiency, it seems to be a dynamical trap that persists for a long, and perhaps indefinite, time. On 
the other hand, synonymy does not reduce the efficiency of communication, but appears to be only 
a transient feature of the language. Thus, in our model the role of synonymy decreases and in the 
long-time limit it becomes negligible. A similar rareness of synonymy is observed in present natural 
languages. The role of noise, that distorts the communicated words, is also examined. Although, in 
general, the noise reduces the communication efficiency, it also regroups the words so that they are 
more evenly distributed within the available "verbal" space. 
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I. INTRODUCTION 

Computational modelling is becoming more and more 
important tool to study langauge evolution [U, [3, 0, |3j @ ■ 
The central assumption of such an approach is that lan- 
guage is a complex adaptive system that emerges from 
local interactions between its users, and evolves and com- 
plexifies according to biological-like principles of evolu- 
tion and self-organization 6, X 8^ . This is by no means 
the only possibility since a number of researches claims 
that language does not have the adaptive values and is 
merely a byproduct of ha ying a large and complex brain 
or of some other skills [§, [lOf. Recently, however, adap- 
tationists got a strong support from Pinker and Bloom, 
who in their influential paper (llj argued that linguis- 
tic abilities require complex and costly adaptations (e.g., 
large brain, longer infancy period, descended larynx) and 
the language origin can be explained only by means of 
natural selection theory. 

Since language was invented only in one lineage, and is 
therefore unique to human species, its appearance has the 
same status as the origin of genetic code or the eukaryotic 
cell. The emergence of language was thus listed as one of 
the major transition in the evolution of life on Earth [l^ 
and it is certainly interesting to ask which factor is re- 
sponsible for it. Some claims were made that most likely 
it was the combination of selective evolutionary pressure 
and unique context that lead to the emergence of human 
language p^ . 

Language has also lead to the novel inheritance sys- 
tem [14| and opened up the possibility for cumulative cul- 
tural evolution and creation of complex society [l^ with 
collaboration of large non-kin groups fl6j . While our will- 
ingness to share information with relatives is rather easy 
to reconcile with darwinian evolution (kin-selection [l7j). 
linguistic interactions with non-kin individuals are harder 
to understand. Indeed, since speaking is costly (it takes 
time, energy and sometimes might expose a speaker to 
the predators), and listening is not, such a situation 



seems to favour selfish individuals that would only lis- 
ten but would not speak. Moreover, in the case of the 
conflict of interests the emerging communication system 
would be prone to misinformation or lying. A possible 
resolution of these problems is based on reciprocal altru- 
ism [l^. However, there is a growing evidence that co- 
operation and altruistic behaviour between humans are 
very complex and typically cannot be explained using 
standard reciprocal altruism arguments [l6|. 

As an alternative explanation Dessalles[l5 suggests 
that honest information is given freely because it is prof- 
itable - it is a way of competing for status within a group. 
In this context, an interesting computer simulations were 
made by Hurford(20|. He considered agents engaged in 
communicative tasks (one speaker and one hearer) and 
their abilities evolved with the genetic algorithm that was 
set to prefer either communicative or interpretative suc- 
cess. Only in the former case the emerging language was 
similar to natural languages where synonymy was rare 
and homonymy tolerated. When interpretative success 
was used as the basis of selection then the converse sit- 
uation (unknown in natural language) arose: homonymy 
was rare and synonymy tolerated. 

Indeed, synonymy in the pure variety is rare. Usu- 
ally, it can be found in two languages being in contact 
(napkin/serviette), handy abbreviations (bicycle/bike) or 
some specialized euphemistic domains related e.g., with 
sex (fuck/shag/ . . . ), death (croak/expire/. . . ) or bod- 
ily functions {shit/crap/ . . .). Linguists proposed various 
explanations of the human avoidance of synonymy. Clark 
attributes it to a presumably inborn tendency of humans 
to seek and create new meanings, rather than accept one 
meaning for several different forms [2l| . Markman notes 
that children have a tendency to assume that no two 
words may overlap in meaning [2^ . A similar point of 
view is expressed in Wexler's Uniqueness Principle which 
prevents the child from internalizing more than one form 
per meaning [23| . On the other hand, homonymy seems 
to be more common in natural languages. One can eas- 
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ily think of many words having multiple and unrelated 
meanings (e.g., abstract, compound, second, present). At 
first sight one can consider this as surprising since syn- 
onymy does not diminish communicative efficiency but 
homonymy in principle does. Let us also notice that 
computer languages quite often accept synonymy (e.g., 
aliases in command systems) but typically do not handle 
homonymy. 

In our opinion an apparent asymmetry between rare 
synonymy and relatively common homonymy is an im- 
portant and generic feature of natural languages and 
might be used as a test of various computational mod- 
els of language development. In the present paper we 
examine a version of the Steels naming game model [l^ 
where two agents exchange information concerning a cer- 
tain number of facts/objects from their reality and try 
to establish a common vocabulary. The emerging lan- 
guage features some degree of homonymy and synonymy. 
Although homonymy diminishes the communicative ef- 
ficiency it turns out to be a persistent feature of the 
language. On the other hand synonymy is only a tran- 
sient feature of the language and its frequency of ap- 
pearance diminishes over time. The asymmetry between 
homonymy and synonymy can be thus understood within 
a rather simple naming-game setup, without revoking 
evolutionary arguments that speaker more than hearer 
benefits from the conversation Let us also notice 

that stable homonymy and transient synonymy was also 
reporte d by Puglisi et al. in a model of formation of cat- 
egories 2JI . We also examine the role of noise that might 



distort communicated words. Our results show that noise 
plays (or played) an important role and could affect the 
distribution of words in a "verbal" space. 



II. MODEL 

More than a decade ago Steels proposed the naming 
game model, that quickly became one of the basic mod- 
els of the emergence of linguistic coherence 25| . In this 
model we have a group of agents that communicate with 
each other trying to establish a common vocabulary on 
a certain number of objects. Typically, after some time, 
they reach a state of linguistic coherence where they to 
large extent (or even perfectly) understand each other. 
In the original formulation the naming game model de- 
scribes cultural transmission within a single generation 
of agents. Evolutionary versions with mutations and se- 
lections of agents taking place were also studied [ 26l , [27| . 
In most works on the naming game model only a sim- 
ple structure of the emerging language is allowed and 
homonymy is very often excluded j28l. l29l Isd^. Such ap- 
proaches effectively can be regarded as if agents would 
talk on a single object [s^l- Although it drastically sim- 
plifies the language structure such an approach allows 
to consider many agents and thus to take into account 
some elements of the social structure [l^. But such 
works constitute only one end point of the computational 



dilemma: many agents of simple-architecture versus few 
but with complex-architecture. At the other side we 
have models of few agents but able to develop language 
of much larger complexity. To examine linguistic struc- 
tures like homonyms or synonyms, one has to consider 
an n-object version of the naming game model. Some 
results on n-object naming game model have been al- 
ready reported ^31, 32]. Let us also notice that the main 
emphasis in the naming game model is on the cultural 
(single-generation) transmission of language. An alter- 
native approach to the language evolution were inter- 
generational interactions play an important role is called 
Iterated Learning Model and was used in various con- 
texts Hi]. 

Our model is a two-agent version of the naming game. 
It is assumed that agents are embodied in a shared 
environment and communicate on a certain number of 
facts/objects from this environment. Agents in turns 
take the role of speaker and hearer. Speaker selects an ob- 
ject from the environment. Then, using its form-meaning 
relations, speaker selects a word that is assigned to the 
object. The word is communicated to hearer, that uses its 
own meaning-form relations to guess the communicated 
meaning. We also assume that after such a communi- 
cation attempt there is a possibility to check (e.g., by 
pointing at the object) whether hearer guessed the com- 
municated meaning correctly. Established in such a way 
success or failure modifies the structure of meaning-form 
relations of agents to facilitate future communication at- 
tempts. 

Both agents refer to the common set of n objects and 
with each object each agent relates the corresponding 
inventory (inventories are numbered from 1 to n). 
Each inventory stores up to / words that are used to 
describe the corresponding object. With each word in 
a repository the weight w is associated that controls 
the stochastic process of selecting a communicated word 
(speaker) and decoding the meaning (hearer). The idea 
of assigning weights to words was already used in some 
naming game models (32| . For computational purposes 
the words are represented by integer numbers from 1 
to r but more natural representations using strings of 
letters are also possible. The parameter r can be thus 
interpreted as corresponding to the capacity of the 
"verbal space". More detailed rules of our model are 
specified below: 



• Speaker randomly selects an object. From the 
inventory that corresponds to the selected object 
speaker selects the communicated word Xc- The 
word is selected taking into account the weights 
corresponding to each word in this inventory. We 
used the method of roulette selection. 

• Hearer tries to guess the meaning of the commu- 
nicated word and decodes it. To do that, hearer 
first calculates measures of similarity s'^{xc) of 
the communicated word with k-th inventory (fc = 



l,2,...,n). The measures s'^^Xc), that are calcu- 
lated using the following formula 
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are then used to select the inventory that fits the 
communicated word (roulette selection again but 
with s'^{xc) as a weight of an inventory). In Eq. ([1]) 
Xi and Wi are the i-th word and its weight, respec- 
tively, and the summation is over all elements of the 
k-th inventory (numerated with i). The closer Xi 
to the Xc is, the larger its contributions to the simi- 
larity measure s''{xc) are. The role of e in Eq. ([T|) is 
to keep s'^{xc) finite even when the communicated 
word is the same as one of the words in the k-th 
inventory. Having calculated s''{xc) for all invento- 
ries, hearer uses the roulette selection to choose the 
inventory that fits the communicated word. Since 
in our calculations e takes rather small values, in- 
ventories that contain a communicated word (or 
words that are very close to it) get large similar- 
ity measures and have larger probabilities of being 
selected. 

— When the inventory selected by the hearer has 
the same number as that selected by speaker, 
we consider this as a communicative success. 
In such a case both agents increase the weights 
associated with the communicated word by 
one. If in the hearer inventory there is no such 
a word (but it still has decoded the meaning 
correctly) we add the communicated word to 
this inventory with unit weight (if the inven- 
tory contains already I elements we first re- 
move the word with the smallest weight). 

— When the inventory selected by the hearer 
has a different number than that selected by 
the speaker we consider this as a communica- 
tive failure. In such a case speaker decreases 
the weight associated with the communicated 
word by one. Hearer inspects its inventory 
that has the same number as that selected 
by the speaker. If it contains the communi- 
cated word, its weight is increased. Other- 
wise, hearer adds the word to this inventory 
with unit weight. 

Our simulations show that the model is relatively 
robust and small changes of its rules or of values 
of parameters do not change much the behaviour 
of the model. In particular, similar results are ob- 
tained when an increase or decrease of weights in 
the case of success or failure is done either with a 
fixed or weight-dependent amount (e.g. the larger 
the weight, the smaller the increase). Let us also 
notice that the increase or decrease of weight in 
the case of success or failure, respectively, resem- 
bles the reinforcement learning approach and some 
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FIG. 1; A communication attempt with n = 4 objects. 
Speaker selected the second object and one of the words that 
are associated with this object. The selected word is commu- 
nicated to the hearer that then decodes its meaning. Since the 
decoded object is the same as that chosen by the speaker, the 
above example is considered as a success. The selection of the 
communicated word and its decoding are stochastic in nature 
(see the main text) and controlled by weights w associated 
with each word. 



par am. 


description (values used in simulations) 


n 


number of objects (100 < n < 10"^) 


1 


memory size - maximum number of words 
corresponding to an object {5 < I < 20) 


r 


words - positive integer numbers not greater than r 
(500 <r < 10*) 


e 


Ensures that similarity measure in Eq. ^ 
is finite (10"^ < e < 10"^) 


p, a 


parameters describing noise 
(see Eq.[2j {0 < p < 0.05, < a < 10) 



TABLE I: Parameters of the model and ranges of values used 
in the simulations. 



naming game models with a similar dynamics have 
been already examined [31l |. 

• In some of our simulations we have examined the ef- 
fect of noise that distorts the communicated word. 
More precisely, we assume that with the probabil- 
ity p the communicated word chosen by speaker 
becomes 



X + 1] 



(2) 



where 77 is a random integer number uniformly 
drawn from the interval < —a, a > and a is the 
amplitude of noise (with the probability I — p the 
communicated word does not change). If x calcu- 
lated using Eq. ([2]) happens to be outside the range 
< 1, r >, a different instance of rj is generated. 

An example that illustrates the above rules is shown 
in Fig.[TJ Table U collects all parameters of the model. 
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III. NUMERICAL CALCULATIONS 

To start the simulations an initial configuration is 
needed. We assume that at the beginning each agent 
has in each inventory a single word (randomly selected 
from the interval < 1, r >) with unit weight. To examine 
the behaviour of the model we measured various quanti- 
ties that in some cases were averaged over certain time 
intervals or over independent runs.( We define the unit of 
time as corresponding to 2n communication attempts.) 
Of particular interest is the communicative success rate 
of an agent, that is defined as a fraction of successful com- 
munication attempts. Some other quantities that allow 
us to analyze in more details the structure of the emerg- 
ing language and of the communication process will be 
specified later. 



A. Basic properties 

Simulations show that typically the agents correlate 
their inventories so that their communication maintains 
a rather large success rate (Fig[2Ji). Of our further inter- 
est will be words that in a given inventory have the largest 
weight. Since some of them might be the same for dif- 
ferent inventories, we calculated the number of different 
largest- weight words in the resulting language. It turned 
out that this number is close to the the number of objects 
n {P'lgWp) a-iid most of the communication attempts use 
these largest-weight words (Fig[5J:). It means that in ma- 
jority of cases communication between agents proceeds as 
follows: Speaker selects an object and the largest-weight 
word from the inventory corresponding to this object be- 
comes the communicated word. For small e the simi- 
larity measure, as calculated from Eq.([T]), is large only 
for the inventory that contains the communicated word 
(provided that the weight of this word is not very small). 
Usually, it happens to be the inventory corresponding to 
the same object as selected by speaker and thus such an 
attempt is successful. For larger e (^0.1) the communi- 
cation between agents deteriorates and both the success 
rate and the number of different largest-weight words di- 
minish. 

There are two factors that contribute to the commu- 
nication failure. First, it is the finite number of e that 
implies that similarity measure ([1]) for different words 
is positive and thus selection of the inventory made by 
the hearer might lead to communication failure. Sec- 
ond, homonyms, that as we shall see might appear in 
our model, also might lead to the communication failure. 
Their role is discussed in detail in the next subsection. 

In Fig. [3] we present the distribution of largest- and 
second-largest-weight words that is established after a 
sufficiently long transient. Relatively uniform distribu- 
tion indicates that these words are uncorrelated. Since 
some of the second-largest weight words, as discussed be- 
low, might be considered as synonyms (of the largest- 
weight words), the lack of correlations agrees with the 
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FIG. 2: The time evolution of (a) the success rate; (b) the 
number of different largest-weight words; and (c) the fraction 
of second-largest-weight utterances. Calculations were made 
for n = 500, « = 10 and r = 10^ 



observation that synonyms in natural languages are not 
similar to each other. 



B. Homonymy and synonymy 

Since agents communicate on more than one object 
the resulting language might contain homonyms and syn- 
onyms. Homonymy appears when a word can be associ- 
ated with more than one objects and synonymy when an 
object can be associated with several words. However, 
the rules of our model contain probabilistic factors and so 
the definition of homonymy and synonymy must take this 
fact into account. We define homonymy as a word that 
with a relatively large probability can be associated with 
several objects. Typically such a situation occurs when 
a word uttered by the speaker appears in more than one 
inventories of the hearer as the largest-weight word. Con- 
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FIG. 3: The distribution of largest- weight words and the 
largest-weight words after simulations of time t = 10^. Cal- 
culations were made for n = 500, r — 10^, e = 10~^, and 
and / = 10. Different plotting symbols (circles, crosses) cor- 
respond to different agents. Quite often both agents have in 
some inventories (that usually corresponds to the same ob- 
ject) the same largest- and second-largest-weight words and 
in such a case the plotted symbols overlap. 
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FIG. 5: The time evolution of the number of different largest- 
weight words for n = 500, / — 10 and e — 10~^. Small 
squares at the t — axis indicate the values for the randomly 
drawn words (see text). One can notice that during simu- 
lations a redistribution of largest-weight words takes place 
and that reduces the number of homonyms in the language. 
However, the number of largest-weight words saturates be- 
low n and that shows that homonyms are a persistent feature 
of language. For large range r the (almost) homonymy-free 
language is obtained. 
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FIG. 4: Homonymy occurs when a word (1244) can be asso- 
ciated with more then one object. Synonymy occurs when an 
object is associated with more than one word. 



sequently, the number of different largest-weight "words 
is a measure of homonymy of the language (the smaller 
this number is, the more frequent the homonymy is). 
Analogously, synonymy most often occurs "when speaker 
and hearer in their inventories corresponding to a certain 
object have the same largest- and second-largest-"weight 
"words. In such a case, no matter "which of them is selected 
for communication, it is quite probable that the meaning 
will be guessed correctly. Examples of such situations are 
shown in Fig. ID 

Since homonymy typically occurs when more than one 
inventory has the same largest-weight word, we exam- 
ined in more detail the number of different largest-weight 
words and the results are shown in Fig.O One can notice 



that as the interval r from which the words are drawn in- 
creases, this number tends to the number of objects n, 
and that means that homonyms become less frequent. 
This is because for large r there are many words to chose 
from and the probability that two inventories have the 
same largest- weight word decreases. Let us notice, how- 
ever, that homonymy might appear also in some naming 
game models with an unbounded reservoir of words (in 
our case it corrsponds to the limit r oo) [sl ]. 

Naively, one might expect that the number of differ- 
ent largest-weight words can be obtained from the simple 
probabilistic arguments: let us select randomly n num- 
bers from the interval < l,r > and check how many of 
them are different. We did such calculations and numer- 
ical results are also shown in Fig. [5] (small squares along 
the t — axis) . One can notice that this agrees with sim- 
ulations but only initially. The subsequent evolution of 
the model changes the initial distribution and the number 
of different largest-weight words increases in time. Since 
success rate and the number of different largest-weight 
words behave similarly (Fig. [5^,b), such a redistribution 
reduces homonymy and enables more efhcient communi- 
cation between agents. However, saturation below the 
maximal value (seen in Fig. [5]), equal to the number of 
objects n, indicates that homonymy is a persistent fea- 
ture of language. 

Fig. [2j: shows that a fraction of communication at- 
tempts is made with the second-largest weight words. 
When such an attempt is successful it usually means that 
there is more than one word that is associated with a 
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FIG. 6; The time evolution of the success rate of utterances 
with largest- and second-largest- weight words. Relatively 
large success rate of utterances with second-largest weight 
words indicates that more than one word can be associated 
with some objects, i.e., some words can be treated as syn- 
onyms. Increasing in time fluctuations of the second-largest 
weight data are due to poor statistics caused by the decreas- 
ing number of such utterances (i.e, synonymy decreases over 
time). Simulations were made for n = 500, r = 10^, Z = 10, 
and e = 10-^ 
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homonymous utterances remains constant. On the other 
hand the frequency of synonymous utterances decreases 
in time. 

Provided that the model bears some similarity to the 
evolution of natural languages, one can expect that in 
present-day languages, that correspond to the long-time 
limit of the language that emerge in our model, syn- 
onymy, in agreement with some observations, would be 
rare. It was already suggested by Hurford [20| that 
rareness of synonymy is caused by the asymmetry of 
evolutionary benefits between speaker and hearer. Let 
us emphasize that our model uses only cultural (single- 
generation) mechanisms for the evolution of language. 
The results thus show that understanding of some basic 
features of homonymy and synonymy can be obtained 
within a much simpler model that does not take into ac- 
count any evolutionary effects. Let us also notice that 
persistent homonymy and transient synonymy was also 
reported by Puglisi et al. [23] in an interesting model of 
category formation. In their model, that also includes 
cultural dynamics only, agents are exposed to the con- 
tinuous environment (i.e., with an infinite number of ob- 
jects) and such a behaviour was observed even when the 
reservoir of possible words is unbounded. As a matter 
of fact transient nature of synonymy is a more generic 
property of the Naming Game, although in some models 
persistent synonymy was reported ^35j . 



given object, which for our purposes defines synonymy. 
That such words do ensure a relatively large success rate 
is confirmed in Fig.[6l where the time evolution of the suc- 
cess rate of utterances with largest- and second-largest- 
weight words is shown. Indeed, relatively large success 
rate of utterances with second-largest weight words in- 
dicates that more than one word can be associated with 
some objects, i.e., some words can be treated as syn- 
onymous. However, the decrease of frequency of second- 
largest-weight utterances seen in Fig. [2j; and (related 
with that) large fluctuations seen in Fig.[B]show that the 
role of synonyms diminish in time. In the long-time limit 
synonymous second-largest-weight words become irrele- 
vant since entire communication proceeds with largest- 
weight words only. 

A trace of synonymy can be also seen in Fig. [31 Indeed, 
overlapping plotting symbols (circles and crosses) show 
that both agents have a substantial fraction of the same 
largest- and second-largest-weight words in correspond- 
ing inventories. This plot, however, does not tell us that 
for many of these pairs, the weight of the largest-weight 
word is so much dominant that other words from this 
inventory are essentially negligible (since they are never 
used), especially after long simulations. Although quanti- 
tative estimation of the role of homonymy and synonymy 
depends on parameters, some generic behaviour seems 
to characterize our model. In particular, homonymy, al- 
though rare for large r, is a persistent feature of the lan- 
guage: except for the initial time interval, frequency of 



C. The effect of noise and distribution of words 

All calculations reported so far were made for the 
noiseless case {p = 0), i.e., under the assumption that 
communication of a word to another agent is perfect and 
cannot change the word. Now we relax this assumption 
and examine the role of noise that might distort the com- 
municated word as specified in Eq. In our opinion, 
especially at early stages of the evolution of language 
communication could be exposed to such a disturbance. 

Because of noise, the received word might be differ- 
ent than that uttered by the speaker. If the difference is 
small, the hearer might still correctly decode it. We ex- 
pect that this will be often the case when the amplitude 
of noise is small or the largest-weight words are well sep- 
arated so that the small change does not lead to the over- 
lap with some similar words. As we have already noticed 
(Fig. O, during the evolution of the model a redistri- 
bution of largest-weight words takes place, that reduces 
homonymy and improves communication between agents. 
Fig. [7] shows that noise greatly magnifies such a redistri- 
bution. In this figure we present the distribution of dis- 
tances d between neighbouring largest-weight words com- 
pared with the distribution where largest-weight words 
are selected randomly. One can notice that noise leads 
to the more even distribution (within the available range) 
with substantially reduced number of overlaps {d — 0) as 
well as of large voids. 

Noise also changes the distribution of second-largest- 
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FIG. 7: The average number of intervals N{d) of a given dis- 
tance d between neighbouring largest-weight words. Calcula- 
tions were made for n = 500 r = 10^, I = 10, and e = 10~^. 
This data show that noise greatly magnifies redistribution of 
words so that they are more evenly distributed within the 
available range (overlaps and large distances between words 
are much less likely, comparing to the distribution where they 
are selected independently). 
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FIG. 8: The distribution of largest- weight words and the 
second-largest-weight words after t = 10^ simulations. Calcu- 
lations were made for n — 500, r = 10"^, I — 10, e = 10~^, and 
ri = 0.05. Different plotting symbols correspond to different 
agents. 

weight words. Accumulation of points along the diagonal 
line seen in Fig. [S] shows that in presence of noise the 
second largest-weight words are very often close to the 
largest-weight words. In such a case they should not be 
considered as synonyms (that are usually much different) 
but as the same words but e.g., with a slightly modified 
pronunciation. When noise is absent there is no such 
accumulation (Fig. [3]). 

It is possible that noise played an important role in the 
evolution of language and helped to redistribute words 



within available phonetic space (Fig. [7|) and/or reduced 
the number of synonyms (Fig. [5]). Actually, it would be 
interesting to obtain the analogue of the distribution of 
distances between words shown in Fig. [3 but obtained 
for natural languages. Although the very definition of 
distance between words remains under debate, various 
algorithms of mainly phonetic comparison are already in 
use [1^ and some statistical analysis in principle could 
be made. 



IV. CONCLUSIONS 

In the present paper we studied an n-object naming 
game between two agents and examined the structure 
of the emergent common vocabulary. Our results show 
that after an initial transient a linguistic synchronization 
is reached and efhcient communication of agents is es- 
tablished: speaker selects an object and a corresponding 
word that is communicated to hearer that usually cor- 
rectly decodes the intended meaning. Our main results 
are twofold: 

• A small fraction of communication attempts use 
homonyms or synonyms. Although homonyms re- 
duce the efficiency of communication they appear 
to be a rather persistent feature of the language. 
On the other hand, synonyms do not reduce such 
an efficiency but are gradually expelled from the 
language. The model supports thus the observa- 
tion that nowadays in natural languages synonyms 
are rare, but related observations were also made in 
another type of models by Puglisi et al. [l^l . More- 
over, it seems to us that the present model, that 
has only one generation of agents and does not re- 
fer to the notion of fitness is simpler than that used 
by Hurford where the rareness of synonymy 
was attributed to the asymmetry of payoff between 
speaker and hearer. 

• The second main result is to show that noise plays 
(or played) an important role in the evolution of 
language. It enhanced redistribution of words and 
probably contributed to the reduction of synonymy 
of the language. 

It would be desirable to extend our model to a multi- 
agent. Let us notice, however, that such simulations are 
likely to be computationally very demanding (in such a 
case the dynamics of the model will be slower and the 
amount of calculations needed for the model to reach 
the linguistic synchronization will be much larger). Ad- 
ditional problem might be related with examining the 
structure of emerging language and the present paper 
shows that even the preparatory (two-agent) version pro- 
vides rich and nontrivial behaviour. Since, however, 
human linguistic interactions take place in multi-agent 
regime, such an extension should be examined. Another 
possibility might be to introduce a fitness function and 
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implement evolutionary changes that in some versions 
of naming game models are known to result in qualita- 
tively novel behaviour [2^1 ■ Let us notice that our model 
neglects, among others, sound-merging effects as well as 
interactions of a given language with spatially neighbour- 
ing languages. Such factors often provide an important 
source of homonyms and synonyms. These factors might 
be taken into account in a multi-agent version of our 
model. Morever, it would be interesting to examine a 
situation where the number of objects n could differ from 



the number of inventories, would depend on an agent and 
in addition would be determined in some dynamical pro- 
cess of category formation as in the paper of Puglisi et 
al. H. 
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